Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automated system for text detection in individual video images

Identifieur interne : 001806 ( Main/Exploration ); précédent : 001805; suivant : 001807

Automated system for text detection in individual video images

Auteurs : Y. Du [États-Unis] ; C. I. Chang ; P. D. Thouin

Source :

RBID : Pascal:03-0353866

Descripteurs français

English descriptors

Abstract

Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author>
<name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author>
<name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">03-0353866</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 03-0353866 EI</idno>
<idno type="RBID">Pascal:03-0353866</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000608</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000183</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000571</idno>
<idno type="wicri:doubleKey">1017-9909:2003:Du Y:automated:system:for</idno>
<idno type="wicri:Area/Main/Merge">001885</idno>
<idno type="wicri:Area/Main/Curation">001806</idno>
<idno type="wicri:Area/Main/Exploration">001806</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Automated system for text detection in individual video images</title>
<author>
<name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Univ. of Maryland Baltimore Country Remote Sensing Signal Image Proc Lab Dept. of Comp. Sci. and Elec. Eng.</s1>
<s2>Baltimore, MD 21250</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
</author>
<author>
<name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</author>
</analytic>
<series>
<title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
<imprint>
<date when="2003">2003</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Journal of Electronic Imaging</title>
<title level="j" type="abbreviated">J Electron Imaging</title>
<idno type="ISSN">1017-9909</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic target recognition</term>
<term>Color</term>
<term>Color video images</term>
<term>Feature extraction</term>
<term>Image analysis</term>
<term>Image segmentation</term>
<term>Information retrieval</term>
<term>Multistage pulse code modulation</term>
<term>Optical character recognition</term>
<term>Text detection</term>
<term>Text processing</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Théorie</term>
<term>Analyse image</term>
<term>Traitement texte</term>
<term>Extraction caractéristique</term>
<term>Couleur</term>
<term>Segmentation image</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche information</term>
<term>Reconnaissance automatique cible</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated system for text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Maryland</li>
</region>
</list>
<tree>
<noCountry>
<name sortKey="Chang, C I" sort="Chang, C I" uniqKey="Chang C" first="C. I." last="Chang">C. I. Chang</name>
<name sortKey="Thouin, P D" sort="Thouin, P D" uniqKey="Thouin P" first="P. D." last="Thouin">P. D. Thouin</name>
</noCountry>
<country name="États-Unis">
<region name="Maryland">
<name sortKey="Du, Y" sort="Du, Y" uniqKey="Du Y" first="Y." last="Du">Y. Du</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001806 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001806 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:03-0353866
   |texte=   Automated system for text detection in individual video images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024